Learning Control Composition in aComplex
نویسنده
چکیده
In this paper, reinforcement learning algorithms are applied to a foraging task, expressed as a control composition problem. The domain used is a simulated world in which a variety of creatures (agents) live and interact, reacting to stimuli and to each other. In such dynamic, uncertain environments , fast adaptation is important, and there is a need for new architectures that facilitate on-line learning. Recently, control composition has shown its potential in reducing the complexity of learning in many domains, permitting more complex tasks to be addressed. Results presented here restate the above claim and also show that the use of a modiied version of the Q-learning algorithm performs better in the non-Markovian sequential decision task proposed. In addition, it is shown that shaping can be successfully used to improve the performance of these learning algorithms in complex environments.
منابع مشابه
The effects of train composition with brain learning approach based on the Educational self-efficacy and written ability of students
Purpose: Present study was conducted with the aim of the effects of teaching method of writing with brain-based learning on educational self-efficacy and written ability of female students in fifth grade of primary school. Materials & Methods: A quasi-experimental method with designing pre-test and post-test with unequal control group was used and to select the sample, the one-stage cluster sam...
متن کاملArtificial Intelligence for Inferential Control of Crude Oil Stripping Process
Stripper columns are used for sweetening crude oil, and they must hold product hydrogen sulfide content as near the set points as possible in the faces of upsets. Since product quality cannot be measured easily and economically online, the control of product quality is often achieved by maintaining a suitable tray temperature near its set point. Tray temperature control method, however, is n...
متن کاملبررسی تأثیر یادگیری مشارکتی بر پیشرفت انشانویسی دانشآموزان شهرستان ساوه
The purpose of this study was to survey the impact of collaborative learning on development of students’ composition and writing skills and its comparison with classic method. Statistical population was students in 7th grade in Saveh in 2011-2012 academic year from whom 140 students (70 males and 70 females) were selected as the sample by cluster sampling. The required data were provided throug...
متن کاملA model for specification, composition and verification of access control policies and its application to web services
Despite significant advances in the access control domain, requirements of new computational environments like web services still raise new challenges. Lack of appropriate method for specification of access control policies (ACPs), composition, verification and analysis of them have all made the access control in the composition of web services a complicated problem. In this paper, a new indepe...
متن کاملSimulation and Control of an Aromatic Distillation Column
In general, the objective of distillation control is to maintain the desired products quality. In this paper, the performances of different one point control strategies for an aromatic distillation column have been compared through dynamic simulation. These methods are: a) Composition control using measured composition directly. This method sufferes from large sampling delay of measuring de...
متن کامل